Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 53908 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 142 |
| Duplicate rows (%) | 0.3% |
| Total size in memory | 5.9 MiB |
| Average record size in memory | 115.2 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 1 |
| Dataset has 142 (0.3%) duplicate rows | Duplicates |
carat is highly overall correlated with price and 3 other fields | High correlation |
price is highly overall correlated with carat and 3 other fields | High correlation |
x is highly overall correlated with carat and 3 other fields | High correlation |
y is highly overall correlated with carat and 3 other fields | High correlation |
z is highly overall correlated with carat and 3 other fields | High correlation |
color has 6774 (12.6%) zeros | Zeros |
clarity has 732 (1.4%) zeros | Zeros |
Reproduction
| Analysis started | 2023-01-27 08:19:09.906385 |
|---|---|
| Analysis finished | 2023-01-27 08:19:18.491975 |
| Duration | 8.59 seconds |
| Software version | pandas-profiling vv3.6.2 |
| Download configuration | config.json |
carat
Real number (ℝ)
| Distinct | 268 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.79726942 |
| Minimum | 0.2 |
|---|---|
| Maximum | 3.67 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 842.3 KiB |
Quantile statistics
| Minimum | 0.2 |
|---|---|
| 5-th percentile | 0.3 |
| Q1 | 0.4 |
| median | 0.7 |
| Q3 | 1.04 |
| 95-th percentile | 1.7 |
| Maximum | 3.67 |
| Range | 3.47 |
| Interquartile range (IQR) | 0.64 |
Descriptive statistics
| Standard deviation | 0.47235698 |
|---|---|
| Coefficient of variation (CV) | 0.59246846 |
| Kurtosis | 0.9593992 |
| Mean | 0.79726942 |
| Median Absolute Deviation (MAD) | 0.32 |
| Skewness | 1.0829479 |
| Sum | 42979.2 |
| Variance | 0.22312112 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.3 | 2604 | 4.8% |
| 0.31 | 2249 | 4.2% |
| 1.01 | 2240 | 4.2% |
| 0.7 | 1981 | 3.7% |
| 0.32 | 1840 | 3.4% |
| 1 | 1556 | 2.9% |
| 0.9 | 1485 | 2.8% |
| 0.41 | 1382 | 2.6% |
| 0.4 | 1299 | 2.4% |
| 0.71 | 1292 | 2.4% |
| Other values (258) | 35980 |
| Value | Count | Frequency (%) |
| 0.2 | 12 | < 0.1% |
| 0.21 | 9 | < 0.1% |
| 0.22 | 5 | < 0.1% |
| 0.23 | 293 | |
| 0.24 | 254 | |
| 0.25 | 212 | |
| 0.26 | 253 | |
| 0.27 | 233 | |
| 0.28 | 198 | |
| 0.29 | 130 |
| Value | Count | Frequency (%) |
| 3.67 | 1 | |
| 3.65 | 1 | |
| 3.51 | 1 | |
| 3.5 | 1 | |
| 3.4 | 1 | |
| 3.24 | 1 | |
| 3.22 | 1 | |
| 3.11 | 1 | |
| 3.05 | 1 | |
| 3.04 | 2 |
cut
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 842.3 KiB |
| 2 | |
|---|---|
| 3 | |
| 4 | |
| 1 | |
| 0 | 1606 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 53908 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 3 |
| 3rd row | 1 |
| 4th row | 3 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 21544 | |
| 3 | 13777 | |
| 4 | 12079 | |
| 1 | 4902 | 9.1% |
| 0 | 1606 | 3.0% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 21544 | |
| 3 | 13777 | |
| 4 | 12079 | |
| 1 | 4902 | 9.1% |
| 0 | 1606 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 21544 | |
| 3 | 13777 | |
| 4 | 12079 | |
| 1 | 4902 | 9.1% |
| 0 | 1606 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 53908 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 21544 | |
| 3 | 13777 | |
| 4 | 12079 | |
| 1 | 4902 | 9.1% |
| 0 | 1606 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 53908 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 21544 | |
| 3 | 13777 | |
| 4 | 12079 | |
| 1 | 4902 | 9.1% |
| 0 | 1606 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 53908 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 21544 | |
| 3 | 13777 | |
| 4 | 12079 | |
| 1 | 4902 | 9.1% |
| 0 | 1606 | 3.0% |
color
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5936967 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 6774 |
| Zeros (%) | 12.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 631.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.7011154 |
|---|---|
| Coefficient of variation (CV) | 0.65586522 |
| Kurtosis | -0.86672627 |
| Mean | 2.5936967 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.18970512 |
| Sum | 139821 |
| Variance | 2.8937937 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 3 | 11284 | |
| 1 | 9795 | |
| 2 | 9537 | |
| 4 | 8295 | |
| 0 | 6774 | |
| 5 | 5418 | |
| 6 | 2805 | 5.2% |
| Value | Count | Frequency (%) |
| 0 | 6774 | |
| 1 | 9795 | |
| 2 | 9537 | |
| 3 | 11284 | |
| 4 | 8295 | |
| 5 | 5418 | |
| 6 | 2805 | 5.2% |
| Value | Count | Frequency (%) |
| 6 | 2805 | 5.2% |
| 5 | 5418 | |
| 4 | 8295 | |
| 3 | 11284 | |
| 2 | 9537 | |
| 1 | 9795 | |
| 0 | 6774 |
clarity
Real number (ℝ)
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.8359427 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 732 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 631.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.7242259 |
|---|---|
| Coefficient of variation (CV) | 0.44949208 |
| Kurtosis | -0.82231017 |
| Mean | 3.8359427 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.17564092 |
| Sum | 206788 |
| Variance | 2.9729549 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=8)
| Value | Count | Frequency (%) |
| 2 | 13061 | |
| 5 | 12254 | |
| 3 | 9184 | |
| 4 | 8167 | |
| 7 | 5066 | 9.4% |
| 6 | 3654 | 6.8% |
| 1 | 1790 | 3.3% |
| 0 | 732 | 1.4% |
| Value | Count | Frequency (%) |
| 0 | 732 | 1.4% |
| 1 | 1790 | 3.3% |
| 2 | 13061 | |
| 3 | 9184 | |
| 4 | 8167 | |
| 5 | 12254 | |
| 6 | 3654 | 6.8% |
| 7 | 5066 | 9.4% |
| Value | Count | Frequency (%) |
| 7 | 5066 | 9.4% |
| 6 | 3654 | 6.8% |
| 5 | 12254 | |
| 4 | 8167 | |
| 3 | 9184 | |
| 2 | 13061 | |
| 1 | 1790 | 3.3% |
| 0 | 732 | 1.4% |
depth
Real number (ℝ)
| Distinct | 184 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 61.749373 |
| Minimum | 43 |
|---|---|
| Maximum | 79 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 842.3 KiB |
Quantile statistics
| Minimum | 43 |
|---|---|
| 5-th percentile | 59.3 |
| Q1 | 61 |
| median | 61.8 |
| Q3 | 62.5 |
| 95-th percentile | 63.8 |
| Maximum | 79 |
| Range | 36 |
| Interquartile range (IQR) | 1.5 |
Descriptive statistics
| Standard deviation | 1.4321416 |
|---|---|
| Coefficient of variation (CV) | 0.023192812 |
| Kurtosis | 5.7500791 |
| Mean | 61.749373 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | -0.082274287 |
| Sum | 3328785.2 |
| Variance | 2.0510295 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 62 | 2239 | 4.2% |
| 61.9 | 2162 | 4.0% |
| 61.8 | 2075 | 3.8% |
| 62.2 | 2038 | 3.8% |
| 62.1 | 2019 | 3.7% |
| 61.6 | 1955 | 3.6% |
| 62.3 | 1940 | 3.6% |
| 61.7 | 1904 | 3.5% |
| 62.4 | 1792 | 3.3% |
| 61.5 | 1719 | 3.2% |
| Other values (174) | 34065 |
| Value | Count | Frequency (%) |
| 43 | 2 | |
| 44 | 1 | |
| 50.8 | 1 | |
| 51 | 1 | |
| 52.2 | 1 | |
| 52.3 | 1 | |
| 52.7 | 1 | |
| 53 | 1 | |
| 53.1 | 1 | |
| 53.2 | 2 |
| Value | Count | Frequency (%) |
| 79 | 2 | |
| 78.2 | 1 | |
| 73.6 | 1 | |
| 72.9 | 1 | |
| 72.2 | 1 | |
| 71.8 | 1 | |
| 71.6 | 2 | |
| 71.3 | 1 | |
| 71.2 | 1 | |
| 71 | 1 |
table
Real number (ℝ)
| Distinct | 127 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57.456775 |
| Minimum | 43 |
|---|---|
| Maximum | 95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 842.3 KiB |
Quantile statistics
| Minimum | 43 |
|---|---|
| 5-th percentile | 54 |
| Q1 | 56 |
| median | 57 |
| Q3 | 59 |
| 95-th percentile | 61 |
| Maximum | 95 |
| Range | 52 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.2339938 |
|---|---|
| Coefficient of variation (CV) | 0.038881295 |
| Kurtosis | 2.8031094 |
| Mean | 57.456775 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7969554 |
| Sum | 3097379.8 |
| Variance | 4.9907284 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 56 | 9878 | |
| 57 | 9722 | |
| 58 | 8364 | |
| 59 | 6564 | |
| 55 | 6267 | |
| 60 | 4239 | |
| 54 | 2592 | 4.8% |
| 61 | 2278 | 4.2% |
| 62 | 1272 | 2.4% |
| 63 | 588 | 1.1% |
| Other values (117) | 2144 | 4.0% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 44 | 1 | < 0.1% |
| 49 | 2 | < 0.1% |
| 50 | 2 | < 0.1% |
| 50.1 | 1 | < 0.1% |
| 51 | 9 | < 0.1% |
| 51.6 | 1 | < 0.1% |
| 52 | 56 | |
| 52.4 | 1 | < 0.1% |
| 52.8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 95 | 1 | < 0.1% |
| 79 | 1 | < 0.1% |
| 76 | 1 | < 0.1% |
| 73 | 4 | < 0.1% |
| 71 | 1 | < 0.1% |
| 70 | 9 | < 0.1% |
| 69 | 9 | < 0.1% |
| 68 | 21 | < 0.1% |
| 67 | 41 | |
| 66 | 91 |
price
Real number (ℝ)
| Distinct | 11593 |
|---|---|
| Distinct (%) | 21.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3929.2491 |
| Minimum | 326 |
|---|---|
| Maximum | 18823 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 842.3 KiB |
Quantile statistics
| Minimum | 326 |
|---|---|
| 5-th percentile | 544 |
| Q1 | 949 |
| median | 2400.5 |
| Q3 | 5321 |
| 95-th percentile | 13090.6 |
| Maximum | 18823 |
| Range | 18497 |
| Interquartile range (IQR) | 4372 |
Descriptive statistics
| Standard deviation | 3985.0936 |
|---|---|
| Coefficient of variation (CV) | 1.0142125 |
| Kurtosis | 2.1807202 |
| Mean | 3929.2491 |
| Median Absolute Deviation (MAD) | 1669.5 |
| Skewness | 1.6186343 |
| Sum | 2.1181796 × 108 |
| Variance | 15880971 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 605 | 132 | 0.2% |
| 802 | 127 | 0.2% |
| 625 | 126 | 0.2% |
| 828 | 125 | 0.2% |
| 776 | 124 | 0.2% |
| 789 | 121 | 0.2% |
| 698 | 121 | 0.2% |
| 544 | 120 | 0.2% |
| 666 | 114 | 0.2% |
| 552 | 113 | 0.2% |
| Other values (11583) | 52685 |
| Value | Count | Frequency (%) |
| 326 | 2 | |
| 327 | 1 | |
| 334 | 1 | |
| 335 | 1 | |
| 336 | 2 | |
| 337 | 2 | |
| 338 | 1 | |
| 339 | 1 | |
| 340 | 1 | |
| 342 | 1 |
| Value | Count | Frequency (%) |
| 18823 | 1 | |
| 18818 | 1 | |
| 18806 | 1 | |
| 18804 | 1 | |
| 18803 | 1 | |
| 18797 | 1 | |
| 18795 | 2 | |
| 18791 | 2 | |
| 18787 | 1 | |
| 18784 | 1 |
x
Real number (ℝ)
| Distinct | 547 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.7310318 |
| Minimum | 3.73 |
|---|---|
| Maximum | 9.86 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 842.3 KiB |
Quantile statistics
| Minimum | 3.73 |
|---|---|
| 5-th percentile | 4.29 |
| Q1 | 4.71 |
| median | 5.7 |
| Q3 | 6.54 |
| 95-th percentile | 7.66 |
| Maximum | 9.86 |
| Range | 6.13 |
| Interquartile range (IQR) | 1.83 |
Descriptive statistics
| Standard deviation | 1.1184522 |
|---|---|
| Coefficient of variation (CV) | 0.19515722 |
| Kurtosis | -0.72348831 |
| Mean | 5.7310318 |
| Median Absolute Deviation (MAD) | 0.92 |
| Skewness | 0.39365217 |
| Sum | 308948.46 |
| Variance | 1.2509354 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4.37 | 448 | 0.8% |
| 4.34 | 437 | 0.8% |
| 4.33 | 429 | 0.8% |
| 4.38 | 428 | 0.8% |
| 4.32 | 425 | 0.8% |
| 4.35 | 407 | 0.8% |
| 4.39 | 388 | 0.7% |
| 4.31 | 387 | 0.7% |
| 4.36 | 386 | 0.7% |
| 4.4 | 373 | 0.7% |
| Other values (537) | 49800 |
| Value | Count | Frequency (%) |
| 3.73 | 2 | < 0.1% |
| 3.74 | 1 | < 0.1% |
| 3.76 | 1 | < 0.1% |
| 3.77 | 1 | < 0.1% |
| 3.79 | 2 | < 0.1% |
| 3.81 | 3 | |
| 3.82 | 2 | < 0.1% |
| 3.83 | 3 | |
| 3.84 | 4 | |
| 3.85 | 6 |
| Value | Count | Frequency (%) |
| 9.86 | 1 | < 0.1% |
| 9.66 | 1 | < 0.1% |
| 9.65 | 1 | < 0.1% |
| 9.54 | 1 | < 0.1% |
| 9.53 | 1 | < 0.1% |
| 9.51 | 1 | < 0.1% |
| 9.49 | 1 | < 0.1% |
| 9.44 | 3 | |
| 9.42 | 2 | |
| 9.41 | 1 | < 0.1% |
y
Real number (ℝ)
| Distinct | 543 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.732866 |
| Minimum | 3.68 |
|---|---|
| Maximum | 9.81 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 842.3 KiB |
Quantile statistics
| Minimum | 3.68 |
|---|---|
| 5-th percentile | 4.3 |
| Q1 | 4.72 |
| median | 5.71 |
| Q3 | 6.54 |
| 95-th percentile | 7.64 |
| Maximum | 9.81 |
| Range | 6.13 |
| Interquartile range (IQR) | 1.82 |
Descriptive statistics
| Standard deviation | 1.1103599 |
|---|---|
| Coefficient of variation (CV) | 0.19368322 |
| Kurtosis | -0.73491855 |
| Mean | 5.732866 |
| Median Absolute Deviation (MAD) | 0.92 |
| Skewness | 0.38838966 |
| Sum | 309047.34 |
| Variance | 1.2328992 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4.34 | 437 | 0.8% |
| 4.37 | 435 | 0.8% |
| 4.35 | 425 | 0.8% |
| 4.33 | 421 | 0.8% |
| 4.32 | 414 | 0.8% |
| 4.39 | 407 | 0.8% |
| 4.38 | 406 | 0.8% |
| 4.4 | 387 | 0.7% |
| 4.31 | 386 | 0.7% |
| 4.41 | 384 | 0.7% |
| Other values (533) | 49806 |
| Value | Count | Frequency (%) |
| 3.68 | 1 | < 0.1% |
| 3.71 | 2 | < 0.1% |
| 3.72 | 1 | < 0.1% |
| 3.73 | 1 | < 0.1% |
| 3.75 | 1 | < 0.1% |
| 3.77 | 2 | < 0.1% |
| 3.78 | 5 | |
| 3.8 | 1 | < 0.1% |
| 3.81 | 1 | < 0.1% |
| 3.82 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9.81 | 1 | |
| 9.63 | 1 | |
| 9.59 | 1 | |
| 9.48 | 1 | |
| 9.46 | 1 | |
| 9.42 | 1 | |
| 9.4 | 1 | |
| 9.38 | 2 | |
| 9.37 | 1 | |
| 9.34 | 1 |
z
Real number (ℝ)
| Distinct | 363 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5392049 |
| Minimum | 2.06 |
|---|---|
| Maximum | 6.38 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 842.3 KiB |
Quantile statistics
| Minimum | 2.06 |
|---|---|
| 5-th percentile | 2.65 |
| Q1 | 2.91 |
| median | 3.53 |
| Q3 | 4.04 |
| 95-th percentile | 4.73 |
| Maximum | 6.38 |
| Range | 4.32 |
| Interquartile range (IQR) | 1.13 |
Descriptive statistics
| Standard deviation | 0.6907805 |
|---|---|
| Coefficient of variation (CV) | 0.19517957 |
| Kurtosis | -0.72451483 |
| Mean | 3.5392049 |
| Median Absolute Deviation (MAD) | 0.57 |
| Skewness | 0.38926874 |
| Sum | 190791.46 |
| Variance | 0.4771777 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.7 | 767 | 1.4% |
| 2.69 | 748 | 1.4% |
| 2.71 | 738 | 1.4% |
| 2.68 | 730 | 1.4% |
| 2.72 | 697 | 1.3% |
| 2.67 | 649 | 1.2% |
| 2.73 | 612 | 1.1% |
| 2.66 | 555 | 1.0% |
| 2.74 | 548 | 1.0% |
| 4.02 | 538 | 1.0% |
| Other values (353) | 47326 |
| Value | Count | Frequency (%) |
| 2.06 | 1 | < 0.1% |
| 2.24 | 1 | < 0.1% |
| 2.25 | 1 | < 0.1% |
| 2.26 | 1 | < 0.1% |
| 2.27 | 1 | < 0.1% |
| 2.28 | 1 | < 0.1% |
| 2.29 | 1 | < 0.1% |
| 2.3 | 2 | < 0.1% |
| 2.31 | 6 | |
| 2.32 | 3 |
| Value | Count | Frequency (%) |
| 6.38 | 1 | |
| 6.27 | 1 | |
| 6.16 | 1 | |
| 6.13 | 1 | |
| 6.03 | 2 | |
| 5.98 | 1 | |
| 5.97 | 1 | |
| 5.92 | 1 | |
| 5.91 | 1 | |
| 5.9 | 2 |
| carat | color | clarity | depth | table | price | x | y | z | cut | |
|---|---|---|---|---|---|---|---|---|---|---|
| carat | 1.000 | 0.249 | -0.216 | 0.030 | 0.195 | 0.963 | 0.997 | 0.996 | 0.995 | 0.113 |
| color | 0.249 | 1.000 | -0.023 | 0.049 | 0.028 | 0.150 | 0.245 | 0.245 | 0.251 | 0.036 |
| clarity | -0.216 | -0.023 | 1.000 | -0.053 | -0.085 | -0.116 | -0.214 | -0.212 | -0.218 | 0.142 |
| depth | 0.030 | 0.049 | -0.053 | 1.000 | -0.245 | 0.010 | -0.023 | -0.025 | 0.103 | 0.406 |
| table | 0.195 | 0.028 | -0.085 | -0.245 | 1.000 | 0.172 | 0.202 | 0.196 | 0.160 | 0.290 |
| price | 0.963 | 0.150 | -0.116 | 0.010 | 0.172 | 1.000 | 0.964 | 0.963 | 0.959 | 0.093 |
| x | 0.997 | 0.245 | -0.214 | -0.023 | 0.202 | 0.964 | 1.000 | 0.998 | 0.989 | 0.133 |
| y | 0.996 | 0.245 | -0.212 | -0.025 | 0.196 | 0.963 | 0.998 | 1.000 | 0.988 | 0.136 |
| z | 0.995 | 0.251 | -0.218 | 0.103 | 0.160 | 0.959 | 0.989 | 0.988 | 1.000 | 0.134 |
| cut | 0.113 | 0.036 | 0.142 | 0.406 | 0.290 | 0.093 | 0.133 | 0.136 | 0.134 | 1.000 |
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
| carat | cut | color | clarity | depth | table | price | x | y | z | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.23 | 2 | 1 | 3 | 61.5 | 55.0 | 326 | 3.95 | 3.98 | 2.43 |
| 1 | 0.21 | 3 | 1 | 2 | 59.8 | 61.0 | 326 | 3.89 | 3.84 | 2.31 |
| 2 | 0.23 | 1 | 1 | 4 | 56.9 | 65.0 | 327 | 4.05 | 4.07 | 2.31 |
| 3 | 0.29 | 3 | 5 | 5 | 62.4 | 58.0 | 334 | 4.20 | 4.23 | 2.63 |
| 4 | 0.31 | 1 | 6 | 3 | 63.3 | 58.0 | 335 | 4.34 | 4.35 | 2.75 |
| 5 | 0.24 | 4 | 6 | 7 | 62.8 | 57.0 | 336 | 3.94 | 3.96 | 2.48 |
| 6 | 0.24 | 4 | 5 | 6 | 62.3 | 57.0 | 336 | 3.95 | 3.98 | 2.47 |
| 7 | 0.26 | 4 | 4 | 2 | 61.9 | 55.0 | 337 | 4.07 | 4.11 | 2.53 |
| 8 | 0.22 | 0 | 1 | 5 | 65.1 | 61.0 | 337 | 3.87 | 3.78 | 2.49 |
| 9 | 0.23 | 4 | 4 | 4 | 59.4 | 61.0 | 338 | 4.00 | 4.05 | 2.39 |
| carat | cut | color | clarity | depth | table | price | x | y | z | |
|---|---|---|---|---|---|---|---|---|---|---|
| 53930 | 0.71 | 3 | 1 | 2 | 60.5 | 55.0 | 2756 | 5.79 | 5.74 | 3.49 |
| 53931 | 0.71 | 3 | 2 | 2 | 59.8 | 62.0 | 2756 | 5.74 | 5.73 | 3.43 |
| 53932 | 0.70 | 4 | 1 | 5 | 60.5 | 59.0 | 2757 | 5.71 | 5.76 | 3.47 |
| 53933 | 0.70 | 4 | 1 | 5 | 61.2 | 59.0 | 2757 | 5.69 | 5.72 | 3.49 |
| 53934 | 0.72 | 3 | 0 | 2 | 62.7 | 59.0 | 2757 | 5.69 | 5.73 | 3.58 |
| 53935 | 0.72 | 2 | 0 | 2 | 60.8 | 57.0 | 2757 | 5.75 | 5.76 | 3.50 |
| 53936 | 0.72 | 1 | 0 | 2 | 63.1 | 55.0 | 2757 | 5.69 | 5.75 | 3.61 |
| 53937 | 0.70 | 4 | 0 | 2 | 62.8 | 60.0 | 2757 | 5.66 | 5.68 | 3.56 |
| 53938 | 0.86 | 3 | 4 | 3 | 61.0 | 58.0 | 2757 | 6.15 | 6.12 | 3.74 |
| 53939 | 0.75 | 2 | 0 | 3 | 62.2 | 55.0 | 2757 | 5.83 | 5.87 | 3.64 |
Most frequently occurring
| carat | cut | color | clarity | depth | table | price | x | y | z | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 83 | 0.79 | 2 | 3 | 2 | 62.3 | 57.0 | 2898 | 5.90 | 5.85 | 3.66 | 5 |
| 0 | 0.30 | 1 | 6 | 4 | 63.4 | 57.0 | 394 | 4.23 | 4.26 | 2.69 | 2 |
| 1 | 0.30 | 2 | 3 | 1 | 62.1 | 55.0 | 863 | 4.32 | 4.35 | 2.69 | 2 |
| 2 | 0.30 | 2 | 3 | 5 | 63.0 | 55.0 | 675 | 4.31 | 4.29 | 2.71 | 2 |
| 3 | 0.30 | 2 | 4 | 2 | 62.2 | 57.0 | 450 | 4.26 | 4.29 | 2.66 | 2 |
| 4 | 0.30 | 2 | 4 | 2 | 62.2 | 57.0 | 450 | 4.27 | 4.28 | 2.66 | 2 |
| 5 | 0.30 | 3 | 0 | 2 | 62.2 | 58.0 | 709 | 4.31 | 4.28 | 2.67 | 2 |
| 6 | 0.30 | 4 | 3 | 5 | 63.0 | 55.0 | 526 | 4.29 | 4.31 | 2.71 | 2 |
| 7 | 0.30 | 4 | 6 | 4 | 63.4 | 57.0 | 506 | 4.26 | 4.23 | 2.69 | 2 |
| 8 | 0.31 | 1 | 0 | 2 | 63.5 | 56.0 | 571 | 4.29 | 4.31 | 2.73 | 2 |